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We previously developed NetPath as a resource for comprehensive manually curated signal transduction pathways. The 
pathways in NetPath contain a large number of molecules and reactions which can sometimes be difficult to visualize or 
interpret given their complexity. To overcome this potential limitation, we have developed a set of more stringent curation 
and inclusion criteria for pathway reactions to generate high-confidence signaling maps. NetSlim is a new resource that 
contains this 'core' subset of reactions for each pathway for easy visualization and manipulation. The pathways in NetSlim 
are freely available at http://www.netpath.org/netslim. 

Database URL: www.netpath.org/netslim 



Introduction 

Advances in methods for the detection and identification 
of proteins, particularly their post-translational modifica- 
tions (PTMs) and protein-protein interactions (PPIs) have 
contributed enormously to our understanding of signal 
transduction. However, such information available in the 
literature is heterogeneous with respect to molecules 
and their reactions in various signal transduction pathways. 



The spatiotemporal organization and behavior of proteins 
still remains a sizeable challenge to the scientific commu- 
nity. Although these concerns have yet to be fully ad- 
dressed, currently available signaling reactions do provide 
a basic framework to investigate the molecular mechanisms 
and dynamics of signal transduction events. Indeed, com- 
posite maps of these signaling events are often used to 
visualize, hypothesize and to develop mathematical 
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models to predict the behaviors of normal and diseased 
cells (1, 2). Moreover, these signaling maps can also be over- 
laid with biological data from various high-throughput ex- 
perimental platforms to obtain an integrated view of 
biological processes (3, 4). 

NetPath (http://www.netpath.org) is a publicly available 
resource of comprehensively and manually curated signal 
transduction pathways (5) developed previously by our 
group using an open-source pathway annotation tool, 
PathBuilder (6). NetPath already includes numerous mol- 
ecules and reactions which are not included in pathways 
or signaling maps available elsewhere (7). It can be 
argued that although all of the reactions in NetPath are 
derived from the literature, many of the pathways in 
NetPath are already too unwieldy because of the large 
number of molecules in each pathway and the large 
number of reactions. The Gene Ontology consortium de- 
veloped GO Slim as a way to generate a list of core GO 
terms (8). Here, we applied more stringent criteria to path- 
ways in NetPath to develop a similar 'slim' version of path- 
ways containing only high-confidence reactions as a means 
to generate less complex and more easily visualized version 
of individual signaling pathways. These newly generated 
'slim' signaling maps, designated NetSlim, are freely avail- 
able through http://www.netpath.org/netslim. 

Development of NetSlim 

NetPath currently contains a number of cancer and immune 
signaling pathways whose reactions are available in stand- 
ard formats including proteomics standards initiative — mo- 
lecular interactions (PSI-MI) and biological pathway 
exchange (BioPAX) formats (5, 9-12). The pathway reac- 
tions are categorized as: (i) PPIs comprising both binary 
and complex interactions; (ii) enzyme-substrate reactions 
comprising both direct and induced catalyzed reactions; 
(iii) transport events that specify movement of molecules 
across distinct subcellular compartments; and (iv) differen- 
tial gene expression coupled to transcriptional regulators, if 
known. Each reaction is linked directly to the published lit- 
erature and information pertaining to the host organism, 
cell-type and subcellular location. 

Criteria for NetSlim 

In generating NetSlim, we applied the following inclusion 
criteria to extract a high confidence set of reactions from 
pathways in NetPath: (i) reactions supported by at least two 
research articles; (ii) reactions shown to occur in multiple 
cell types (iii) PPIs reported through high-throughput ana- 
lyses, as well as induced PTMs identified using mass spec- 
trometry are included if they are supported by any other 
experimental method; (iv) components of well established 
protein complexes even if they do not meet other criteria; 
and (v) molecules or reactions described in expert review 



articles. Schematic representation of the NetSlim criteria is 
provided in Figure 1 . Reactions that fulfilled one or more of 
the above criteria were included in NetSlim. 

Graphical representation of NetSlim 

The signaling pathway maps of NetSlim were generated 
using the PathVisio tool which facilitates editing and dis- 
play of biological pathways (13). PathVisio is an improved 
visualization tool which incorporates all the features of 
GenMAPP (14). All molecular reactions that occur down- 
stream of activated receptors are represented by solid 
edges, which depict direct interactions/reactions, or by 
dashed edges, which depict indirect/induced events. 
Signaling nodes are depicted based on the ability of mol- 
ecules to interact with their partners, and on the availabil- 
ity of protein regulators for cross-talk with other signaling 
modules. The topology of the pathway, which determines 
whether a given molecule lies upstream or downstream 
from another reference molecule, is generally dependent 
on the biological information obtained from studies using 
mutants or inhibitors of specific molecules. The edges rep- 
resenting various PPIs, enzymatic reactions or protein trans- 
location events are distinguished from each other by 
distinct colors. NetSlim maps with detailed legends are 
available at http://www.netpath.org/netslim. Pathway 
maps with nodes hyperlinked to specific NetPath molecule 
page and the edges linked to their corresponding citations 
can be obtained by clicking on 'map with citation' option in 
the NetSlim pathway pages. The browsing features and the 
information provided in NetSlim are shown in Figure 2 with 
TGF beta pathway as an example. 

NetSlim map of the TGF-p pathway as a model 

The signal transduction mechanisms underlying the patho- 
physiological activities of transforming growth factor-p 
(TGF-p) have been extensively studied since its discovery 
nearly 30 years ago. Mammals express three TGF-p isoforms 
(i.e. TGF-pi, TGF-P2 and TGF-P3) that are encoded by dis- 
tinct genes in a tissue-specific and developmentally regu- 
lated manner (15). In addition, numerous studies have 
clearly established TGF-p as a multifunctional cytokine 
that plays essential roles in regulating virtually all aspects 
of mammalian development and differentiation, and in 
maintaining mammalian tissue homeostasis (15-18). Even 
more remarkably, malignant cells have evolved a variety 
of complex mechanisms capable of circumventing the 
tumor suppressing activities of TGF-p, and in doing so, typ- 
ically convert the functions of TGF-p to that of a tumor 
promoter, particularly the induction of carcinoma epithe- 
lial-mesenchymal transition, invasion, and dissemination to 
distant organ sites (15-18). This peculiar conversion in 
TGF-p function is known as the 'TGF-p Paradox', which 
underlies the lethality of TGF-p in metastatic cancer cells. 
Thus, elucidating the effectors and signaling modules 
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Figure 1. Schematic representation of the criteria used to select high-confidence pathway reactions from NetPath for NetSlim. 
Reactions supported by at least two research articles are considered confident. Reactions shown to occur in multiple cell types 
for specific pathways are also included. NetSlim further incorporates reactions reported through high-throughput analyses 
which were validated by a focused experiment (in the same research article or another research article). The interactions 
between components of known protein complexes are included to obtain a general view of a pathway. Molecules or reactions 
established through expert review articles are considered for NetSlim in order to provide the core set of reactions in a given 
pathway. 



activated by TGF-p may offer new insights into the devel- 
opment of novel neoadjuvants capable of effectively tar- 
geting the TGF-p pathway to significantly improve the 
clinical course of patients with cancer, fibrosis or immuno- 
logic disorders. 

TGF-p is secreted from cells as a latent homodimeric poly- 
peptide that becomes tethered to the extracellular matrix 
by latent-TGF-p-binding proteins. Mature TGF-p isoforms 
are activated and liberated from extracellular matrix 
depots by a variety of mechanisms, including proteolysis, 
reactive oxygen species, changes in pH and physical inter- 
actions with integrins, thrombospondin-1 or SPARC (19-22). 
Once activated, mature TGF-p initiates transmembrane 



signaling by binding to two distinct transmembrane Serf 
Thr protein kinases, termed TGF-p type I (TpR-l) and type 
II (TpR-ll) receptors (23-25). In some cells and tissues, TGF-p 
also binds to a third cell surface receptor, TGF-p type III 
(TPR-III), which transfers TGF-p to TpR-ll and TpR-l (26). 
Full activation of these cytokine:receptor ternary complexes 
transpires upon TpR-ll-mediated transphosphorylation and 
activation of TpR-l (27, 28), which then phosphorylates and 
activates the latent transcription factors, Smad2 and Smad3 
(29, 30). Afterward, phosphorylated Smad2/3 interact phys- 
ically with Smad4, with the resulting heterotrimers trans- 
locating into the nucleus to regulate the expression of 
TGF-p-responsive genes (29, 30). These Smad-dependent 
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Figure 2. An overview of NetSlim. The NetSlim home page has options to browse the pathway maps. A description of the criteria 
used for NetSlim is available as a separate tab. Choosing a pathway such as the TGF beta signaling pathway, the 'Pathway map' 
tab provides access to the TGF-p NetSlim pathway map image and the 'Description' tab which has a brief description of the 
pathway. The 'map with citation' provided in the NetSlim pathway page contains molecules linked to specific NetPath molecule 
pages and their reactions linked to their corresponding citations. NetSlim maps can be downloaded in different file formats from 
specific pathway pages. The NetSlim data is also available to download in BioPAX, PSI-MI and SBML formats. A 'Feedback' page 
allows users to provide comments about the NetSlim resource. A link to NetSlim is provided on each pathway page in NetPath 
and wee versa. 



events are subject to fine-tuning and crosstalk regulation in 
the cytoplasm by their interaction with a variety of adapter 
molecules, including SARA, Hgs, PMLand Dab2 (31-34), and 
with Smad7 (35, 36), whose inhibitory activity is modulated 
by STRAP, AMSH2 and Arkadia (37-39); and in the nucleus 
by their interaction with a variety of transcriptional activa- 
tors and repressors that occur in a gene- and cell-specific 
manner (40). In addition to activating canonical Smad2/ 
3-dependent signaling, accumulating evidence clearly 
links the development of a variety of human pathologies 
to aberrant coupling of TGF-p to its noncanonical effector 
molecules. Included in this ever expanding list of non- 
canonical signaling molecules stimulated by TGF-p are 
PI3K (41), AKT (42), mTOR (43), integrins (44) and focal 
adhesion kinase (45), and members of the MAP kinase 



[e.g. ERK1/2 (46), JNK (47) and p38 MAPK (48)] small 
GTP-binding proteins (e.g. Ras, Rho and Raci) (18). 

The TGF-p-signaling pathway constructed in NetPath cur- 
rently contains 226 proteins that participate in 482 reac- 
tions comprised of 340 molecular association events, 105 
enzymatic events and 37 translocation events. Graphical 
representation of these molecules in their entirety is quite 
cumbersome, thus making their meaningful interpretation 
difficult for biologists users of the NetPath resource. 
Further many NetPath molecules and their reactions are 
supported by either single publications or derived from 
high-throughput experiments lacking secondary experi- 
mental validation. While the entire network is useful, espe- 
cially to gain coverage in gene expression analyses, users 
often want to see only the high confidence regions of a 
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pathway and this is currently difficult to construct. We have 
now involved experts from the field to contribute their 
knowledge to NetSlim with regard to the inclusion or ex- 
clusion of molecules in high confidence datasets. For in- 
stance, the presented NetSlim model of TGF-p pathway 
possesses 121 proteins involved in 252 reactions with 173 
molecular associations, 60 enzyme catalysis reactions and 
19 translocation events, almost half the size of the full 
NetPath pathway. Review articles contributed to the inclu- 
sion of ~30% of the reactions in TGF beta NetSlim pathway 
map. There were many reactions which were derived from 



high-throughput experiments but were not included be- 
cause no further validation was conducted. 

As such, this TGF-p NetSlim model represents the 'core 
TGF-p pathway' parsed from the less stringent and all inclu- 
sive resource, NetPath. Despite its trimmed appearance, our 
NetSlim model still presents more molecules and reactions 
as compared to major publicly available pathway resources 
containing the same pathway, such as Kyoto Encyclopedia 
of Genes and Genomes (KEGG) (49), National Cancer 
Institute's Pathway Interaction Database (PID) (50), 
Science's Signal Transduction Knowledge Environment 



TGF Beta Signal Transduction Pathway 




Figure 3. TGF-p pathway in NetSlim. The TGF-p pathway map represents the molecular reactions induced by the binding of TGF-p 
1 to TGF-p receptor complex. The annotated NetSlim pathway for TGF-p possesses 121 proteins involved in 252 reactions with 173 
molecular associations, 60 enzyme catalysis reactions and 19 translocation events. The nodes and edges represent the molecules 
and their reactions, respectively. A key for the various symbols, colors and abbreviations used in the pathway diagram is 
provided. 
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(STKE) Connections Maps (51), Reactome (52) or PANTHER 
(53). Some of the pathway resources developed by commer- 
cial entities have been customized to better reflect their 
available product line and reagents, thus presenting a 
skewed and distorted view of the underlying signal 



transduction pathways. Our new high confidence map of 
the TGF-p signaling system is shown in Figure 3. The 
NetSlim criteria have greatly diminished the complexity of 
the current pathway map and we hope that they will prove 
useful to the community. 
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Figure 4. Comparative statistics of molecules and reactions in NetPath and NetSlim. The bar diagram represents the total number 
of molecules (A) and reactions (B) in signaling pathways in NetSlim and NetPath. 
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Impact of NetSlim criteria 

The NetSlim criteria used to extract high confidence 
reactions from NetPath have considerably reduced the 
complexity of the pathway maps. However, as mentioned 
above, these pathway maps contain more molecules and 
reactions than other pathway resources available in the 
public domain. We have captured reactions described in 
the review articles while many others are still not included 
in reviews of the respective pathways in spite of being 
high confidence. The reactions identified in multiple cell 
types could be captured from single or multiple research 
publications and contribute to the list of high-confidence 
reactions. A comparison between the total number of mol- 
ecules and the reactions in NetSlim and NetPath is provided 
in Figure 4. 

Data availability 

Twenty NetSlim maps are currently available, one for each 
currently available NetPath pathway. NetSlim maps are 
freely available to download in .gpml and .mapp formats 
to facilitate overlay of gene or protein expression data. The 
NetSlim data is available to download in BioPAX, PSI-MI 
and SBML formats which can be used for the visualization 
and analysis using software such as Cytoscape (54), 
Ingenuity pathway analysis. Visualization and layout ser- 
vices for BioPAX pathway models (VISIBIOweb) (55) or 
Chisio BioPAX Editor (ChiBE) (56). The NetSlim maps are 
also available in PNG, and PDF formats for visualization 
and presentation. All data is made available under the 
Creative Commons license version 2.5 (57). 

Future directions 

Gene Ontology terms will be provided whenever functional 
significance of the nodes or edges are assayed using inhibi- 
tors, mutants, knockouts or silencing studies. We anticipate 
making both NetPath and NetSlim versions available for 
each pathway as they are curated by our team. We are 
also planning on making the pathway maps available in 
the Systems Biology Graphical Notation (SBGN) (58) 
format in the near future. Finally, we will keep this resource 
somewhat flexible so that the criteria can be updated de- 
pending upon input from the community and pathway 
experts. 

Conclusion 

Expanding application and increasing demand of various 
high-throughput techniques in proteomics have greatly 
accelerated the pace by which mechanistic details of signal 
transduction pathways are uncovered. Given the enormity 
of molecular reactions in the context of pathways already 



described to date, it is clear that we need to initiate sys- 
tematic efforts at developing comprehensive pathway 
resources. NetSlim is an initial step to develop high- 
confidence pathway-specific reaction maps. We hope to in- 
volve the biomedical community as a means to improve the 
quality and accuracy of pathways in NetSlim and to keep 
them updated on an ongoing basis. We hope to establish a 
model where NetPath and NetSlim versions of individual 
signaling pathways will be curated centrally by our team 
and published in peer-reviewed journals with pathway 
authorities and other contributing biologists from the com- 
munity as coauthors. Finally, we believe that NetSlim will 
serve the biomedical community in directing and undertak- 
ing novel pathway-based systems biology approaches that 
have hitherto not been possible. 
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